KMeanCLustering32.dll does not support multithreading so there will be a performance penalty.
Plus, it uses the math coprocessor instead of SSE2 for floating point math so it will run a
bit slower but it does not require a CPU with SSE2. NOTE: the code for this was modified to
compile with Visual C++ 6.0.